Cpu Tlb Stream Buffers L 1 Cache Stream Allocator
نویسندگان
چکیده
We explore a variation of Jouppi's stream buuers scheme that reduces the memory stall time by prefetching lines on cache misses. In Jouppi's scheme since the stream buuers are oo-chip, a penalty is incurred even when the required line is available in the stream buuers. We propose moving the stream buuers on-chip to reduce this penalty. Also a line that hits in the stream buuer is not moved into the cache immediately unless the data is identiied to exhibit temporal locality. We use a stream buuer allocator that integrates the ltering and stride detection mechanisms used by Palacharla. We present trace driven simulation results for some of the SPEC92 benchmarks that demonstrate the eeectiveness of our scheme. The results show that there are improvements over a normal cache in all cases and for some benchmarks our scheme reduces the number of oo-chip memory fetches.
منابع مشابه
Predicting and Prefetching Tlb Entries from Irregular Access Streams
This thesis describes the design and evaluation the Irregular Translation Buffer (ITB), which is a prefetcher for translation lookaside buffer (TLB) entries that is based on the Irregular Stream Buffer (ISB) introduced by Jain and Lin [1]. The main idea to use the ISB's improvements in prefetching cache lines from irregular access streams and use them to prefetch entries from irregular access s...
متن کاملStride-directed Prefetching for Secondary Caches
Thi s paper studies hardware pre fe tch ingfor second-level ( L 2 ) caches. Previous work o n prefetching has been extensive but largely directed a t p r imary caches. In some cases only L 2 prefetching i s possible or i s more appropriate. B y s tudying L2 prefetching characterist ics we show that existing stride-directed methods [l, 81 for L1 caches do no t work as well in L2 caches. W e prop...
متن کاملFurther Cache and TLB Investigation of the RAMpage Memory Hierarchy
The RAMpage memory hierarchy is an alternative to the traditional division between cache and main memory: main memory is moved up a level and DRAM is used as a paging device. Earlier RAMpage work has shown that the RAMpage model scales up better with the growing CPU-DRAM speed gap, especially when context switches are taken on misses. This paper investigates the effect of more aggressive first-...
متن کاملTowards Smart Cache Management for Ontology Based, History-Aware Stream Reasoning
Stream reasoning is an exciting multidisciplinary research area that combines stream processing and semantic reasoning. Its goal is to not only process a dynamic data stream, but also to extract explicit and implicit information on-the-fly. One of its challenges is managing history awareness: how much and which historical data should be held and for how long as we continuously query and reason ...
متن کاملThe Impact of Timeliness for Hardware-based Prefetching from Main Memory
Among the techniques to hide or tolerate memory latency, data prefetching has been shown to be quite effective. However, this efficiency is often limited to prefetching into the first-level cache. With more aggressive architectural parameters in current and future processors, prefetching from main memory to the second-level (L2) cache becomes increasingly more important. In this paper, we exami...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996